Using transport diagnostics to understand Chemistry Climate Model ozone simulations 
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Abstract. We demonstrate how observations of N 2 0 and mean age in the tropical and midlatitude lower 
stratosphere (LS) can be used to identify realistic transport in models. The results are applied to 15 
Chemistry Climate Models (CCMs) participating in the 2010 WMO assessment. Comparison of the 
observed and simulated N 2 0/mean age relationship identifies models with fast or slow circulations and 
reveals details of model ascent and tropical isolation. The use of this process-oriented N 2 0/mean age 
diagnostic identifies models with compensating transport deficiencies that produce fortuitous 
agreement with mean age. 

We compare the diagnosed model transport behavior with a model's ability to produce realistic LS 0 3 
profiles in the tropics and midlatitudes. Models with the greatest tropical transport problems show the 
poorest agreement with observations. Models with the most realistic LS transport agree more closely 
with LS observations and each other. We incorporate the results of the chemistry evaluations in the 
SPARC CCMVal Report (2010) to explain the range of CCM predictions for the return-to-1980 dates for 
global (60°S-60°N) and Antarctic column ozone. Later (earlier) Antarctic return dates are generally 
correlated to higher (lower) vortex Cl y levels in the LS, and vortex Cl y is generally correlated with the 
model's circulation although model Cl y chemistry or Cl y conservation can have a significant effect. In 
both regions, models that have good LS transport produce a smaller range of predictions for the return- 
to-1980 ozone values. This study suggests that the current range of predicted return dates is 
unnecessarily large due to identifiable model transport deficiencies. 
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1. Introduction 


Chemistry climate models (CCMs) are the current state-of-the-art tools used to assess stratospheric 
ozone and make predictions of its future evolution (World Meteorological Organization, in press; WMO, 
2007). Ozone distributions are controlled by transport, chemistry, and temperature (i.e., dynamics and 
radiation). In the stratosphere, the processes that control ozone are expressed by the ozone tendency 
equation, 

d0 3 /dt = Transport + P - L(O x ) + L(NO x ) + L(CI X ) + L(Br x ) + L(HO x ), 

where P is 0 3 production and the L-terms are loss processes due to different radical families. In the 
lower stratosphere 0 3 chemistry is slow and distributions are controlled primarily by transport. In the 
middle and upper stratosphere, photochemistry is fast but transport still plays an important role 
because it controls the distributions of long-lived families that produce radicals involved in 0 3 loss 
processes (Perliski et al., 1989; Douglass et al., 2004). Photochemistry and temperature control the 
steady state balance between radicals and their precursors, e.g., NO x /NO y and CIO x /Cl v . Transport and 
chemistry in a model must both be physically realistic to produce a credible simulation of 0 3 . 

The use of 0 3 as a measure of realism in a simulation is fraught with problems. In some cases, 
compensating deficiencies in the processes affecting 0 3 produce a realistic result. In other cases, an 0 3 
profile or column may be insensitive to some of the terms in the tendency equation. For example, 
Douglass et al. (1997) noted that while their model's 0 3 profiles agreed with observations from several 
UARS instruments, their simulated long-lived tracer profiles did not. The tracer profiles were poor due to 
the dependence on horizontal transport, which was excessive in the model. The good agreement of the 
simulated ozone with observations indicated that the sum of 0 3 loss processes was reasonable, but as 
the long-lived tracers were too high, the relative fractional losses from different cycles (i.e., NO x , HO x , 
CIO x , and O x ) were probably incorrect (Douglass et al., 2004). Comparisons with total column 0 3 
observations of the recent past have been used in previous ozone assessments as an indication of model 
performance (Stratospheric Processes and their Role in Climate (SPARC) Chemistry Climate Model 
Validation (CCMVal), 2010; WMO, 2007; WMO, 2003). Andersen et al. (2006) examined 10 2-D and 4 3-D 
0 3 simulations used in the 2007 WMO assessment and found no correlation between the past and 
future column ozone trends simulated by individual models. They noted that overestimates of 0 3 trends 
in one latitude region compensated for underestimates in other regions giving a false agreement 
between the model 60°S-60°N column 0 3 trend and the trend from observations. The use of column 0 3 


3 



as a measure of model performance goes back at least 20 years (e.g., Jackman et al., 1991) and is still 
used today (e.g., Austin et al., 2010). 

Douglass et al. (1999) proposed setting standards for model evaluation that were based on objective 
comparisons with observations, including quantitative scoring. This approach became feasible in the 
1990's with the availability of multi-year, near-global stratospheric trace gas data sets from satellite- 
based instruments such as those on the Upper Atmosphere Research Satellite (UARS). This approach is 
physically based and therefore may identify areas where model improvement is needed. Objective, 
observationally-based evaluations are also advantageous because they provide a way to quantitatively 
reassess a model after improvements have been made. Today, more than 10 years since Douglass et al. 
(1999), many satellite data sets are available for the development of model diagnostics. Balloon and 
aircraft data sets and ground-based networks also provide valuable observations for the development of 
model evaluation tools. 

Recently, Eyring et al. (2006) and Waugh and Eyring (2008) adopted a physically-based approach for 
evaluating CCM simulations by assessing the representation of processes that affect stratospheric 
ozone. They posit that diagnosing transport and dynamical processes in CCMs, rather than 0 3 , is a 
meaningful way to evaluate a model's ability to make reliable projections of future composition. This 
approach was applied to the CCMs in the 2007 WMO assessment (WMO, 2006) to help interpret model 
predictions of future 0 3 levels. The Stratospheric Processes and their Role in Climate (SPARC) Chemistry 
Climate Model Validation (CCMVal) Report (2010, hereinafter referred to as SCR)) builds on this 
approach and represents the most comprehensive effort to-date toward evaluating model processes by 
developing observationally based diagnostics for radiation, dynamics, transport, and chemistry. The 18 
CCMs participating in the most recent WMO assessment (WMO, in press) were evaluated in this report 
and are listed in Table 1. Details of the chemistry, transport, dynamics, and radiation representations in 
these models, along with details of the reference simulations performed, can be found in Morgenstern 
et al. (2010). 

The transport evaluation in the SCR concluded that tropical ascent and subtropical (horizontal) mixing 
were two fundamental processes that strongly affect the distributions of ozone and ozone-depleting 
substances. By mixing we mean irreversible transport of extratropical air into the tropics and not the 
export of tropical air to the midlatitudes, which is required by mass conservation because pressure 
(mass) decreases with altitude. The SCR concluded that at least half of the participating CCMs had 
significant issues with lower stratosphere (LS) transport, that is, circulation, mixing, or both. A useful 
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transport metric developed in the SCR was the average mean age (AMA). The AMA grade is a measure of 
how well a model agrees with mean age observations at 7 different altitudes and latitudes. Mean age 
depends on both circulation (i.e., tropical ascent rate) and mixing into the tropics, and on the balance 
between them as a function of altitude. Requiring a model to have realistic mean age at many different 
altitudes and latitudes was shown to be the most stringent transport diagnostic yet developed. 
Unfortunately, mean age at pressures other than 50 hPa (~20 km) has only been measured at a few 
latitudes, so the mean age data set necessary to fully constrain model LS transport does not yet exist. 

In the lower stratosphere, mean age is generally less than 4 yrs and is strongly correlated with N 2 0. The 
correlation between simulated N 2 0 and mean age was described in Hall et al. (1999) in an evaluation of 
more than a dozen models. Although there was a wide variation in model mean ages, most differing 
greatly from observations, they found that the models' N 2 0/mean age relationships were qualitatively 
similar. Mean age and N 2 0 are correlated because the longer air remains in the stratosphere, the older it 
gets and the more N 2 0 is photochemically destroyed. There is a limit to this relationship because as 
transport time in the stratosphere increases, the maximum altitude attained increases and all N 2 0 is 
eventually destroyed, ending the correlation. For most of the lower stratosphere, N 2 0 is strongly 
correlated with mean age because its mixing ratios reflect a mixture of young (high N 2 0) air that has not 
travelled to the loss region (~5 hPa and above) and older (low N 2 0) air that has experienced 
photochemical loss. 

In this paper, we show how global observations of N 2 0 can be combined with existing mean age 
observations from 150-30 hPa to extend the usefulness of the AMA concept by using N 2 0 as a proxy for 
age in the LS. The N 2 0/mean age analysis is applied to 15 CCMs and is interpreted in terms of the 
models' representation of tropical ascent and isolation. Comparison of these results with 0 3 profile data 
investigates the physical link between this transport diagnosis and model 0 3 simulations in the tropics 
and midlatitudes below 30 hPa. Since much of the ozone column resides in the lower stratosphere, 
realistic representation of lower stratospheric transport is an essential component of simulating realistic 
total column ozone. We examine predictions of the return-to-1980 ozone columns for the 15 CCMs that 
simulated the 21 st century. Using the transport diagnostics presented here along with some of the 
CCMVal chemistry evaluations, we can explain at least half of the range of model-predicted return dates 
for global (60°S-60°N) and October Antarctic ozone columns. This study suggests that the current range 
of predicted return dates is unnecessarily large due to identifiable modeling deficiencies. While there 
are significant uncertainties in the return dates due to the unknown levels of future ODS and GHG 
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emissions, the use of transport and chemistry diagnostics to identify models with credible LS transport 
and photochemistry may reduce the uncertainty in return dates caused by unrealistic representation of 
important processes controlling stratospheric ozone. 


2. N 2 0 and Mean Age Observations used in the Transport Diagnosis 

In this section we review the observations of N 2 0 and mean age that are used to develop empirical 
constraints on lower stratospheric transport. Satellite observations of N 2 0 from the Atmospheric 
Chemistry Experiment (ACE) satellite instrument onboard SCISAT-1 (2004-2008) (Jones et al., in 
preparation) and from the Microwave Limb Sounder (MLS) satellite instrument onboard Aura (2004- 
2009) (Livesey et al., 2007; Lambert et al., 2007) are used to create an annual mean climatology for each 
5-year data set. The use of a 5-year mean reduces the effect of the quasi-biennial oscillation on LS N 2 0 
distributions. The MLS v2.2 N 2 0 observations have daily 82°S-82°N coverage in the stratosphere 
extending to 100 hPa. The ACE v2.2 N 2 0 has latitudinal coverage that varies during the year; as a result 
the annual mean climatology is restricted to 68°S-68°N. Levels of 150 hPa and above are used here. We 
also use mean age derived from C0 2 observations from balloon (Engel et al., 2009; Andrews et al., ACE 
2001) and aircraft (Andrews et al., 2001) campaigns. Figure 1 shows zonal annual mean MLS N 2 0, ACE 
N 2 0, and their percentage difference. Where these data sets have spatial overlap, 68°S-68°N, and at the 
pressure levels of interest in this study, 100-20 hPa, ACE and MLS differences are almost always less 
than 10%; for most of the LS they are less than 5%. In the ACE N 2 0 validation study, Strong et al. (2008) 
report that the mean profile differences between ACE and MLS is ±5% from 1-100 hPa, with MLS 
showing low bias at pressures greater than 32 hPa. 

Figure 2 displays the mean age data used in this analysis. The tropical profiles come from three OMS 
balloon C0 2 profiles measured in February and November, 1997 (Andrews et al., 2001) and from two 
balloon C0 2 profiles measured in June, 2005 (A. Engel, pers. comm.). The average profile from these two 
data sets (red) and its la uncertainty are used in the analysis. The mean midlatitude mean age and 
uncertainty profiles were derived from C0 2 and SF 6 data from 27 balloon flights, 32°N-51°N, spanning 30 
years (Engel et al., 2009). The bottom panel of Figure 2 shows the 50 hPa (~20 km) mean age as a 
function of latitude. It was derived from numerous aircraft flights in different months but does not 
represent an annual mean as well as the midlatitude profile does. Nevertheless, these data sets 
represent our best estimate of the annual-averaged mean age. 
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Although mean age observations do not have the spatial coverage of satellite N 2 0 measurements, they 
do span a broad latitude and altitude range. Figure 3 shows ACE annual mean N 2 0 from the same 
latitudes and altitudes as the mean age observations. Mean age of less than 4.5 years and N 2 0 greater 
than 150 ppb form a compact and nearly linear relationship. There is a slight deviation from linearity at 
mean age <0.5 yr but the relationship is still compact. Where mean age is >4.5 years, age and N 2 0 are 
uncorrelated. Hall et al. (1999) and Schoeberi et al. (2000) have explained the loss of correlation as a 
result of increasing N 2 0 loss (photochemical exposure) as air ages. This is analogous to the compact 
relationship observed in the LS for N 2 0 and Cl y (Schauffler et al., 2003), which also arises due to 
photochemical aging of air. The uncorrelated region of Figure 3 comes only from measurements in the 
NH midlatitudes, 20-7 hPa, where mean age has a constant value of ~4.8 yrs. The large variation of N 2 0 
while mean age is constant suggests that numerous stratospheric pathways with the same mean age but 
with different photochemical exposure histories bring parcels to this region. For mean ages <4.5 years 
the same linear relationship exists for both tropical (red) and extratropical (blue and green) points 
considered separately, suggesting that for the extrapolar lower stratosphere where N 2 0 >150 ppb (i.e., 
for regions where very old air does not make up a large fraction of the age spectrum), N 2 0 can be used 
as a proxy for mean age. This relationship makes N 2 0 an especially useful transport tracer because of 
the availability of global, multi-year data sets. 

We have tested the fit of the N 2 0/mean age relationship separately for ACE and MLS N 2 0 observations 
and found only very small differences in the fitted slope and offset. This is not surprising given the close 
agreement seen in Fig. 1 from 20-90 hPa. The use of ACE measurements is advantageous because it 
extends the analysis of the N 2 0/mean age relationship into the lowermost stratosphere (150 hPa) and 
its mean absolute difference with other satellite N 2 0 measurements is ±10 ppb in the LS (Strong et al., 
2008). The use of MLS observations has some disadvantages in that there are no retrievals below 100 
hPa and the systematic uncertainty at 100 hPa is large, about 25% (Livesey et al., 2007). In addition, MLS 
N 2 0 has an unrealistic inverted profile in the tropical LS (a local maxima at 70 hPa) that is not seen by 
ACE or in aircraft data. This causes the scatterplot of MLS N 2 0 and mean age to deviate sharply from the 
fitted line below mean age of 0.6 yrs. ACE and MLS N 2 0 have the greatest disagreement at 100 hPa (~10 
%), where MLS is low biased compared to ACE (Strong et al., 2008). For these reasons we use the ACE 
N 2 0/mean age relationship for all analyses. 
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3, The physical basis for the mean age/N 2 0 relationship 


The relationship between N 2 0, the age spectrum, and mean age can be explored using a well-evaluated 
chemistry transport model with credible lower stratospheric transport (Strahan et al., 2007). The Global 
Modeling Initiative (GMI) chemistry and transport model (CTM) was integrated for 5 years using GEOS4- 
GCM meteorological fields ('G4GCM') with boundary conditions and sea surface temperatures (SSTs) for 
1994-1998; this run is referred to as GMI-G4GCM. The G4GCM meteorological fields were produced by 
the GEOSCCM (Pawson et al., 2008), one of the models evaluated in the SCR. An age tracer experiment 
was also run with the GMI CTM by recycling the 5-year set of G4GCM meteorological fields to complete 
a 20-yr simulation. Lower stratospheric transport in this simulation was evaluated by comparisons with 
MLS N 2 0 and 0 3 in Strahan et al. (2007). They showed that this CTM simulation was able to closely 
match means, variability, and annual cycles in both species in the LS. Figure 4 compares mean age from 
the GMI-G4GCM with mean age observations shown in Figure 2. Model mean ages are realistic in the LS 
at most latitudes but slightly young in the SH midlatitude (upper left panel) and at altitudes above 50 
hPa in the NH midlatitudes (upper right panel). The tropical-midlatitude age gradient (lower right panel), 
which is a measure of tropical ascent rate (Neu and Plumb, 1999), is within the uncertainty range of the 
observations suggesting that the ascent rate is about right from 100-10 hPa. Tropical mean ages (lower 
left panel) above 50 hPa may be slightly low, which could result from too much tropical isolation or 
slightly fast ascent rates. Overall, the GMI-G4GCM mean age comparisons shown in Figure 4 suggest 
that this simulation has generally realistic circulation and mixing in the tropics. 

Because of the demonstrated realism of LS transport in the GMI-G4GCM simulation, we use its age 
spectra to show how N 2 0 and mean age are physically linked. Figure 5 shows the model's age spectra 
from 68°S-68°N, 100-10 hPa. Age spectra in the tropics show a large pulse that propagates upwards, 
reflecting the recent arrival of tropospheric air (i.e., young age elements). Moving upwards in the 
tropics, the pulse's amplitude decreases and the size of the tail increases as older air mixes into the 
tropics during ascent. In the extratropics at 52 hPa and above, a larger percentage of the age elements 
are older than 2 yrs, indicating that air may not have arrived directly from the tropics but instead 
through the downward branch of the Brewer-Dobson circulation. In each panel, the area to the left of 
the vertical dashed line shows the youngest 1.8 yrs of the spectrum; this fraction of the spectrum was 
found to have a nearly linear relationship with mean age for levels 37 hPa and below. The mean age of 
each spectrum (black) and the percentage of age elements <1.8 years (red) are also shown in each 
panel. 
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Figure 6 shows that mean age and N 2 0 are related through the percentage of young air in a parcel in the 
GMI-G4GCM simulation. In each panel, the black points show the correlation over the entire model 
domain (surface to mesosphere, pole to pole) while the red points represent only the domain of the 
observations (150-30 hPa, 68°S-68°N). Figure 6a shows that if we consider only points in the observed 
domain (red), we see a line that closely resembles the correlated part of the observed N 2 0/mean age 
relationship shown in Figure 3. Figure 6b shows the relationship between model mean age and the 
percentage of young age elements. The monotonic relationship found in the domain of the observations 
(red) indicates that LS mean age is directly proportional to the contribution from young air. The link 
between young air and l\l 2 0 is demonstrated in Figure 6c. While the model NH and SH have slightly 
offset red curves, both curves show a compact and nearly 1:1 relationship between N 2 0 and young air in 
the domain of the observations. Above 30 hPa in the extratropics, where N 2 0 <100 ppb, descent of older 
air that has experienced photochemical loss contributes significantly to the age spectrum, breaking 
down the mean age/N 2 0 relationship. Age spectra from this region (e.g., the top row of Fig. 5) have a 
long tail, suggesting sufficient transport time for air to have passed through photochemical loss regions. 

In the model LS, N 2 0 shows a monotonic relationship with the percentage of young age elements. In 
these regions, air parcels contain at least 5-10% young air. These results explain why LS N 2 0 can be used 
as a proxy for mean age up to ages of ~4-4.5 yrs. When there is less than 5-10% young air in the 
spectrum, the tail has a controlling influence on the mean age, and because old air in the tail has little or 
no N 2 0, the correlation is lost. 


4. Interpretation of Mean Age and N 2 0 in Chemistry Climate Models 

We now examine N 2 0 and mean age in the CCMs that participated in the 2010 WMO ozone assessment 
in order to assess their LS transport characteristics. Table 1 lists the CCMs analyzed here; details of the 
participating models and additional references can be found in the SCR and in Morgenstern et al. (2010). 
The analysis presented in this section uses zonal monthly mean output of N 2 0 and mean age from the 
last decade of the REF-B1 simulation of the recent past (~1960-2006). Three of the participating CCMS 
did not provide mean age output and thus cannot be evaluated here (CCSRNIES, E39CA, and EMAC). 

Figure 7 compares the observed global N 2 0/mean age relationship with 15 CCMs for the LS (150-30 hPa, 
68S-68N); this is the same data and uncertainty shown in Figure 3. Poleward of 60°S, model output is 
restricted to 50-150 hPa. The interpretation is fairly simple when the model points (red) fall on a curve 
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that is flatter or steeper than the observations (black). For example, most of the top two rows of Figure 
7 (AMTRAC3, CNRM-ACM, Niwa-SOCOL, SOCOL, and UMETRAC) show maximum mean age less than 
observed while showing a minimum N 2 0 lower than observed. This relationship can only be achieved by 
a circulation that is too fast. The fast circulation quickly transports N 2 0 upward in the stratosphere 
where it is photochemically destroyed, resulting in a low mean age for a given value of N 2 0. The bottom 
row (LMDZ and the UMUKCA models) shows model curves that are steeper than observed, have 
maximum mean age that is 1-3 years older than observed, and N 2 0 that is lower than observed. High 
mean age may be the result of a slow circulation, but may also be caused by too much recirculation 
(mixing) between the tropics and midlatitudes. 

While the observations show a nearly linear relationship throughout this LS domain (Fig. 3), not all 
models do. AMTRAC3, CNRM-ACM, and LMDZ show a large change in slope, suggesting that the 
N 2 0/mean age relationship is breaking down. The observations show that this occurs when mean age is 
greater than 4.5 yrs and N 2 0 is less than 150 ppb (Fig. 3) - ~30 hPa and above in the midlatitudes. The 
loss of correlation in the models suggests that parcels found near the top of this domain (30 hPa) have 
already spent significant time at high altitudes where N 2 0 is destroyed. Alternatively, low N 2 0 could be 
caused by a photochemical error. 

UMSLIMCAT and the models in rows 3 and 4 have curves that agree better with the observed slope, 
however, they do not all show the same endpoint as the observations (i.e., the same maximum age and 
minimum N 2 0). Further insight into these models requires a look at tropical transport behavior. 

Figure 8 assesses ascent rate and horizontal mixing by examining the N 2 0 and mean age profiles in the 
tropics. The rightmost column shows the tropical subset of N 2 0/mean age points shown in Figure 7. The 
first three columns of Figure 8 show three different profile comparisons with observations that diagnose 
the effects of 1) ascent and mixing together (tropical mean age), 2) tropical ascent rate (horizontal mean 
age gradient), and 3) horizontal mixing (tropical N 2 0). Tropical mean age (column 1) increases with 
height as a function of both the ascent rate and the horizontal mixing strength. The agreement of model 
and observed tropical mean age only shows that the combined effects of ascent and mixing produce a 
realistic mean age in the model. Columns 2 and 3 identify how ascent and horizontal mixing each 
contributes to the overall tropical transport seen in column 1. These comparisons diagnose whether a 
problem lies with circulation or mixing, or both. 
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The horizontal age gradient profile (column 2) is an empirical measure of ascent rate because tropical 
and midlatitude mean ages are affected equally by horizontal mixing across the subtropics (Neu and 
Plumb, 1999; SCR). When the horizontal age gradient is less (greater) than observed, the ascent rate is 
too fast (slow). The comparison of modeled and observed tropical N 2 0 profiles (column 3) is a measure 
of the cumulative degree of horizontal mixing into the tropics as air ascends. Below 30 hPa, tropical N 2 0 
has essentially no loss so the decreasing mixing ratio is due only to horizontal mixing. Model N 2 0 that is 
higher (lower) than observed can be interpreted as too little (much) mixing of older air into the tropics. 
The tropical N 2 0 differences are normalized to zero at 100 hPa and the yellow shading indicates the 
range of uncertainty in the LS based on validation against several satellite instruments (Strong et al., 
2008). If midlatitude N 2 0 is biased high or low in a model, this diagnostic will have to be considered in 
light of the bias. For example, if the observed-model N 2 0 profile were good but the model's midlatitude 
N 2 0 was known to be low-biased, this would imply that mixing was actually too weak. 

All models in Figure 8a are too young in the tropics, particularly above 70 hPa. CNRM-ACM, Niwa- 
SOCOL, SOCOL, and AMTRAC3 have fast ascent rates. They all have about the right amount of horizontal 
mixing except SOCOL, which has borderline too much mixing above 50 hPa. CNRM-ACM has the 
youngest mean age and fastest ascent of any of the CCMs. UMETRAC has young tropical mean age yet 
has a good ascent rate and good horizontal mixing. Neu and Plumb (1999) note that the horizontal mean 
age gradient is a good measure of ascent rate only in the case of small vertical diffusion. Model vertical 
diffusion that is too strong may explain why mean age is uniformly young throughout the UMETRAC 
stratosphere. Its midlatitude N 2 0 is not biased and so should not affect the interpretation of the mixing 
diagnostic. 

The models in Figure 8b show the best agreement overall with all tropical mean age and N 2 0 diagnostic 
quantities, indicating they have the best ascent rates and horizontal mixing of the CCMs. CMAM has 
slightly rapid ascent near 50 hPa, but otherwise shows very good tropical transport behavior. 
UMSLIMCAT has variable ascent rates, a bit slow below 50 hPa and a bit fast above, but the overall 
tropical transport is good. Both CMAM and UMSLIMCAT are slightly young at 30 hPa. The CAM3.5 age 
gradient indicates fast ascent, but tropical mean age and mixing appear correct. This is explained by the 
high-biased CAM3.5 midlatitude N 2 0. While the mixing diagnostic (column 3) suggests mixing is about 
right, it must actually be too strong because its midlatitude N 2 0 is too high, therefore more of it must 
mix in to sufficiently lower the tropical mixing ratio. In this case, tropical mean age comes out right 
because the fast ascent (which decreases age) is compensated by excessive mixing (which increases 
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age). This is consistent with the WACCM results. WACCM has very good agreement with observed 
midlatitude N 2 0, and WACCM and CAM3.5 use the same atmospheric GCM. WACCM shows borderline 
fast ascent and too much mixing at 50 hPa and above - these effects compensate to give WACCM a 
good tropical mean age profile very much like CAM3.5. GEOSCCM is a little slow in ascent below 50 hPa 
but otherwise agrees very closely with the diagnostics. 

The models in Figure 8c show a mixture of tropical transport problems. MRI produces a good tropical 
mean age profile at all levels but the ascent and mixing diagnostics indicate that this is fortuitous. MRI 
ascent and mixing look good from 100-50 hPa, but above 50 hPa mixing is too strong while ascent is too 
fast. The results for ULAQ are similar, but ULAQ has larger variations in ascent rate: much slower 100-70 
hPa and much faster above 50 hPa. Like MRI, the fast ascent combined with strong mixing results in a 
good tropical mean age profile due to compensating effects from two physically unrealistic processes. 
LMDZ has very slow ascent from 100-50 hPa. Above 50 hPa, the ascent rate appears correct and mixing 
is good at all levels. The UMUKCA models show very slow ascent at all levels, allowing more time for 
horizontal mixing as the air ascends. However, the simple interpretation of the mixing diagnostic is 
incorrect due to the very low-biased midlatitude N 2 0 found in both models. When low N 2 0 mixes into 
the tropics, less mixing is required to produce the 'correct' tropical N 2 0. We can only conclude that slow 
ascent leads to much older than observed age. The results presented here are consistent with the 
transport conclusions drawn in the SCR. 

5. Discussion 

5.1 Transport Effects on Q< in the Lower Stratosphere 

In this section we examine models' LS 0 3 profiles to see if their agreement with observations is related 
to their transport characteristics. We include all models that submitted simulations for the recent past 
('REF-B1') and future ('REF-B2') scenarios. At all latitudes above 30 hPa and in the polar LS, 0 3 is 
influenced by processes in addition to transport, e.g., gas and condensed phase chemistry and 
temperature. Thus, the best place to look for a link between ozone and transport is in the extrapolar LS. 
Restricting the comparison to pressures of 150-30 hPa reduces but does not eliminate the influence of 
photochemistry. Also, model representation of polar loss processes or the polar vortex transport barrier 
may affect midlatitude profiles. 

Figure 9 shows the difference between annual mean 0 3 profiles from Aura MLS and 16 CCMs for the SH 
and NH midlatitudes and the tropics. The model annual means are a 10-yr average calculated from the 
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last 10 years of the REF-B1 run, usually 1997-2006. Although the CCSRNIES model did not submit the 
necessary mean age output for the transport diagnostic, the SCR transport evaluation characterized 
CCRSNIES as having slightly fast ascent rate and too much subtropical mixing. Annual zonal mean was 
calculated from 4 years of Aura MLS v2.2 0 3 (Livesey et al., 2007); precision error is negligible because of 
the large number of profiles in the average. The uncertainty of MLS 0 3 as a function of pressure ranges 
from up to 30% in the lowermost stratosphere to 5-8% at 50 hPa and above. MLS systematic 
uncertainties are indicated by the shaded area about the zero line. The top panels show the differences 
in mixing ratio while the bottom panels show the differences as a percentage of observed 0 3 ; the use of 
percentage makes it easier to examine 0 3 differences where mixing ratios are small (below ~70 hPa). 

The top panels extend to higher levels than the bottom panels to illustrate some of the large 0 3 
differences found in the photochemically active region. No model agrees with MLS within the 
uncertainties at all locations shown. The five models with the best representation of circulation and 
mixing, identified by their tropical N 2 0/mean age relationship in Fig. 8b, are plotted in red. Overall, 
these models show closer agreement with MLS 0 3 than most models, and spread among them is small. 
The five models shown in blue or green have the greatest LS transport problems, either very fast ascent 
with weak mixing (CNRM-ACM), very slow ascent (UMUKCA-METO and UMUKCA-UCAM), or combined 
ascent and mixing problems (MRI and ULAQ). Their performances are consistent with their LS transport 
diagnoses. Below the 0 3 maximum, slow ascent and excessive mixing each act to increase mean age and 
hence 0 3 : the UMUKCA models, MRI, and ULAQ all have higher than observed 0 3 . (For ULAQ this is only 
true in the tropics). Fast ascent produces younger mean age and lower ozone, and CNRM-ACM 
consistently has too low 0 3 . 

The six models shown in black (AMTRAC3, CCSRNIES, LMDZrepro, Niwa-SOCOL, SOCOL, and UMETRAC) 
have identifiable transport deficiencies yet they compare as well with observations from 100-50 hPa as 
do the models with realistic transport. Clearly good agreement with observed LS 0 3 by itself is not 
evidence of good transport. The top center panel shows that three of the six models shown in black 
have much higher than observed tropical 0 3 near 30 hPa (CCRSNIES, Niwa-SOCOL, and SOCOL), 
suggesting a possible chemistry problem. Figure 9 demonstrates that the LS transport diagnostics are 
able to physically link poor LS 0 3 profiles with transport behavior only in cases where the tropical 
transport is quite poor. 
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5.2 Impact of Transport on the Ozone Column 


Roughly half of the ozone column resides in the LS, but to what degree can transport diagnostics explain 
the entire model column? Figure 10 shows the annual global mean (60°S-60°N) of the column 0 3 
anomaly (with respect to 1980) from the REF-B2 (future) simulation for 15 CCMs and observed annual 
mean column 0 3 anomaly from the Merged Ozone Data Set (Stolarski and Frith, 2006). Model 0 3 
columns may be biased because most CCMs do not explicitly calculate tropospheric ozone; most 
comparisons in this paper will be made with column anomalies. Differences in CCMs approaches to 
tropospheric 0 3) as well as differences between the CCMs in the implementation of the REF-B2 forcings, 
such as SST projections, will have some impact on model 0 3 predictions; details of these differences can 
be found in Morgenstern et al. (2010). E39CA, EMAC, and UMETRAC did not a submit REF-B2 simulation 
and thus are not included in the remainder of the discussion. 

Figure 10 shows several interesting results. First, there are two models with very similar transport 
diagnostics (AMTRAC3 and Niwa-SOCOL, Fig. 8a) that showed 'reasonable' agreement with present day 
0 3 profiles yet make widely different predictions of the return to 1980 ozone values. These models, 
shown in black, are the first and the last to cross the zero line (2004 and 2060). This large difference in 
return date in spite of their similar LS 0 3 profiles and their similar transport diagnostics shows that these 
models must have a different balance of processes controlling 0 3 . That is, transport and chemistry (and 
implicitly temperature) have different-sized terms in their ozone tendency equations. Time-varying SSTs 
and source gas forcings in the future scenario cause changes to a model's transport circulation and 
chemistry. When a model responds to a perturbation, e.g., a change in SST or source gases, its 0 3 
change depends on the sensitivity of each term in the 0 3 tendency equation to that change. Two models 
having the same 0 3 profile but different chemistry and transport terms are unlikely to have the same 
response to a perturbation. Differences in sensitivities will result in different size effects on transport 
and chemistry terms for each model, and thus there is no reason to expect that models with the same 
0 3 profile today would predict the same response to a perturbation, i.e., an 0 3 profile 50 years from 
now. By evaluating model transport processes in the LS we are effectively trying determine whether a 
model's 0 3 transport tendency terms (horizontal and vertical) are realistic and how those terms affect a 
model's prediction of future 0 3 . 

This figure also shows that the ozone 60°S-60°N return dates for the set of CCMs do not sort out by 
transport characteristics alone. For example, the four models with fast circulations (AMTRAC3, 
CCSRNIES, Niwa-SOCOL, and SOCOL, all in black) have return dates spanning the full range of the CCM 
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predictions, 2004-2058, yet the two models (UMUKCA) with the slowest circulations have return dates 
near 2025, earlier than the median (2030) or mean (2032) of the 15 CCMs. 

Finally, this figure shows that the five CCMs with the best LS transport characteristics (red) have a 
narrow range of predicted return dates (2026-2040). The random selection of 5 return dates from this 
figure would result in a range of 14 years or less only 7% of the time, so this reduction is unlikely to be 
due to simply reducing the number of results compared. This suggests two things. One, that similar, 
credible LS transport leads to similar 0 3 predictions, and two, that much of the large range of predicted 
0 3 return dates may be the result of problems with model transport. However, the 0 3 tendency 
equation has a number of chemical terms and thus far the role of chemistry has not been considered. 
We expect that physically realistic transport processes are essential for realistic 0 3 simulations, but we 
cannot conclude that the narrow range of predicted return dates is the result of good transport unless 
the models also have realistic chemistry. 

5.3 Role of Chemistry 

Chapter 6 of the SCR evaluated chemistry in CCMs by looking at radicals and radical precursors, tracer- 
tracer correlations, and photolysis rates. This section briefly summarizes major issues affecting 0 3 that 
were identified in the chemistry chapter. It is beyond the scope of this paper to quantify how each 
model's column 0 3 levels are influenced by each contributing process, e.g., transport, temperature, 
losses due to various families such as HO x , NO x , and CIO x ). 

The SCR photochemical intercomparison ('PhotoComp') evaluated the accuracy of J-values, but only 9 
CCMs participated. AMTRAC3, CCSRNIES, and Niwa-SOCOL showed the greatest inaccuracies, while 
UMSLIMCAT, WACCM, GEOSCCM, and LMDZ had highly accurate J-values (SCR, 2010). Photochemical 
steady state (PSS) model comparisons were made for both radicals and radical precursors. Niwa-SOCOL 
did not participate in the PSS comparisons, but it uses photolysis and chemistry schemes that are very 
similar to SOCOL (SCR, 2010). CCSRNIES, CNRM-ACM, and SOCOL had the most disagreements with 
observations for the precursors, and MRI and SOCOL had poor agreement with some radicals (e.g. O'O, 
HO x , NO x /NO y , or CIO/Cl y ). AMTRACS, CAM3.5, MRI, SOCOL, and ULAQ have a problem with the CIO/Cl v 
ratio, and a problem with the Cl y vs N 2 0 relationship was found for AMTRAC3, CNRM-ACM, and SOCOL. 
CCSRNIES, Niwa-SOCOL, SOCOL, and UMUKCA-METO have total chlorine in the upper stratosphere that 
exceeded total Cl emitted at the surface, indicating a lack of conservation of Cl y . (The UMUKCA-METO 
model has excessive Cl y due to mistreatment of tropospheric HCI removal.) The PSS comparison 
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identified no major issues with radicals or precursors for CMAM, EMAC, GEOSCCM, LMDZ, UMSLIMCAT, 
or WACCM. 

Models with both transport and chemistry problems have the potential to produce compensating errors 
that result in fortuitous agreement with 0 3 columns. Of thelO CCMs that agree to within 5% of observed 
annual mean 60°S-60°N ozone columns from 1979-2009 (the green lines in Figure 13), five have 
transport and chemistry problems. Niwa-SOCOL has a fast circulation, lack of conservation of Cl y , and 
inaccuracies in its J-values. CNRM-ACM and SOCOL have fast circulations and disagreements with radical 
precursors, and SOCOL also lacks Cl y conservation. ULAQ has a mixture of circulation and mixing 
problems and a problem with CIO/Cl y . AMTRAC3 has a fast circulation and mixed accuracy for J-values. 
Problems were found in its CIO/Cl y and Cl y /N 2 0 relationships, which are probably due to the 
parameterization used to calculate Cl y . 

Of the five models that demonstrated very good LS transport, none had any major chemistry problems, 
although CAM3.5 has minor problems with CIO/Cl y and NO x /NO y . All five except for UMSLIMCAT also 
agree closely with column 0 3 observations shown in Figure 13. It is worth nothing that the LMDZ model 
performed very well in the photochemical, radical, and precursor evaluations and its only identified 
transport deficiency is slow ascent in the tropics below 50 hPa. As tropical LS ozone is typically very low 
(<2 ppm), it is possible that slow ascent below 50 hPa has only a small impact on its 60°S-60°N column 
mean. The LMDZ return date is within range predicted by the five models with the best LS transport. 

5.4 Return to 1980 Column On in the Antarctic 

In addition to transport and gas phase chemistry, the temporal and spatial distribution of PSC-producing 
temperatures, heterogeneous chemistry, and the presence of mixing barriers also affect polar ozone 
columns. Figure 11 shows October Antarctic column 0 3 and column 0 3 anomalies relative to 1980 for 15 
CCMs. The red lines represent models with good LS transport, black lines are models with good 
agreement with 0 3 profiles but known transport problems, and the green lines are models with known 
transport problems that have poor agreement with 0 3 profiles. The top panel shows that the models 
that most closely agree with observations (to within ±10%) include models with good transport and 
models with poor transport. The same is true for the models that are biased high. Good agreement with 
column 0 3 observations is not, by itself, an indication of realistic transport. 

The bottom panel shows column 0 3 relative to each model's own 1980 value from 1960-2080. The 5 
models with transport problems and poor agreement with 0 3 profiles (green) have the latest return 
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dates of the 15 CCMs. The proximate cause for four of them is vortex Cl y that is as high or higher than 
most models (only MRI has <3.0 ppb Cl y ). In addition to their previously identified transport problems, 
these five models also have additional problems affecting polar ozone loss processes. CNRM-ACM, MRI, 
and ULAQ lack the physical containment necessary to sequester high Cl y /low NO y air in the vortex, yet 
CNRM-ACM and ULAQ produce more than 3 ppb at 80°S (SCR, Ch. 5). CNRM-ACM has a Cl y non- 
conservation problem in the LS (Ch. 6, SCR). All five models except CNRM-ACM have a problem with the 
CIO/Cl y ratio. The UMUKCA models have good vortex isolation but have Antarctic spring temperatures 
that are 4-7K higher than the climatological mean over the 200-50 hPa region (SCR, Ch. 4). The later- 
than-average return dates for the UMUKCA models are consistent with their slow circulations. In spite of 
having only 2.4 ppb Cl y in the present day vortex, MRI has the latest Antarctic return date of any CCM 
(2067). The likely cause for this is a missing CIO loss reaction (SCR, Ch. 6). This leads to higher levels of 
CIO for a given level of Cl y , allowing greater polar ozone loss to occur in spite of low Cl y . 

The red and black lines in the bottom panel of Fig. 11 form two groups with a similar spread of return 
dates, about 20 years each. The black group, representing models with transport problems but 
reasonable 0 3 profiles, is shifted ~6 years earlier than the red group (the models with good transport 
and good profile agreement). The difference between the two groups may reflect the faster circulation 
of most of the black group models and generally lower LS Cl y levels found in them (SCR, Ch. 5). AMTRAC3 
is an exception to this, having high vortex Cl y (3 ppb) and a return date of 2050, comparable to the 
return dates of the green group models, most of which also have 3 ppb or higher Cl y . These results point 
to a relationship between vortex Cl y and return date. 

Figure 12 compares the Antarctic return date with each model's simulated October 2005 mean 80°S Cl y . 
There are two important results. The first is that the Antarctic return date is related to vortex Cl y , and 
the higher the present day Cl y simulated, the later the return date. This is true for 12 of the CCMs 
examined. The exceptions are Niwa-SOCOL, SOCOL, and MRI. Each has a problem with CIO/ Cl y and a 
much later return date than expected based on their 80°S Cl y . While UMUKCA-METO is not an exception, 
it is worth noting a known problem in this model's removal of tropospheric HCI results in excessive 
stratospheric Cl y , affecting where vortex Cl y falls on this line. The second result is that the faster 
circulation models (green) tend to have low Cl y (~1.8-2.4 ppb) while the slower circulation models (blue) 
have higher Cl y (2. 9-3. 8 ppb), so return date is generally related to the model's circulation strength. 

There are exceptions. As previously discussed, AMTRAC3 and CNRM-ACM both have high Cl y at 80°S 
inconsistent with their fast circulation, giving them return dates similar to other high Cl y simulations. For 
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AMTRAC3 this may be related to the parameterization used for production of Cl ¥ from CFCs. For CNRM- 
ACM, high Cl y due to lack of conservation was found in the lowermost stratosphere, but stratospheric 
temperatures and jet locations and strengths are also reported to be very biased (Chapters 4 and 6, 

SCR). In summary, the higher the Cl y , the later the Antarctic return date, but Cl y in the vortex is not solely 
influenced by transport and CIO/Cl y matters as well. 


6. Conclusions: Understanding Model Predictions 

Figure 13 shows 60°S-60°N column 0 3 and column anomalies for 15 CCMs. The top panel shows annual 
zonal mean column 0 3 observations along with 15 CCMs; the models that agree to within 5% of 
observations (~15 DU) are plotted in green. The middle panel shows column anomalies, indicating when 
each model returns to its 1980 60°S-60°N mean column value. The 10 models that agree best with global 
mean observations (green) span the full 54-yr range of return dates (2004-2058). No reduction in the 
range of predictions is found by selecting models based on their ability to reproduce ozone 
observations, nor is there any reduction caused by comparing a smaller set of results (10 rather than 
15). A random selection of 10 return dates would result in a reduction of the range 57% of the time. The 
bottom panel of Figure 13 shows the same column anomalies, where the models having the best LS 
transport are plotted in red. The best LS transport was determined by N 2 0 and mean age agreement 
(Figures 7 and 8). The spread of predicted recovery dates is reduced to 14 years (2026-2040) when 
credible LS transport is considered. This reduction is unlikely to occur solely due to the reduction of the 
sample size (5 CCMs instead of 15), which would produce a range of more than 14 years 93% of the time 
when selected randomly. The models shown in black have identifiable problems with transport, such as 
fast or slow ascent, inappropriate tropical or vortex isolation, problems with radical/reservoir ratios 
(e.g., CIO/Cl y ), or lack of Cl y conservation. 

Column ozone is influenced by so many processes that a simulated column may be close to the observed 
value due to compensating effects from multiple problems. For this reason, column 0 3 should not be 
used as a surrogate for model performance. Similarly, CCMs with good LS transport and good 
stratospheric chemistry will not necessarily produce good agreement with observed columns for several 
reasons, including lack of tropospheric chemistry, problems with upper stratospheric temperatures, and 
heterogeneous chemical processes. The primary focus of this paper is the effect of transport on 0 3 in 
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CCM simulations. Other issues affecting 0 3 were evaluated in the SCR; Morgenstern et al. (2010) 
discusses the approximations used for tropospheric chemistry in these CCMs. 

Ozone recovery in CCMs is a result of decreasing CFCs that are forced by the mixing ratio boundary 
conditions used in the future scenario. The chemistry and transport evaluations that are applied to 
CCMs provide the means to explain much of the variation in return-to-1980 dates. Overall, Antarctic 
recovery is a function of LS Cl y , and Cl y levels are to a large extent controlled by transport. The models 
with LS transport in agreement with observations (i.e., N 2 0 and mean age) show a reduced range of 
return dates. Models with slow circulation have high Cl y and a later recovery, but some models have 
high Cl y due to a parameterization or a conservation issue. One model has low Cl y but has a missing CIO 
loss reaction, so its recovery is similar to models with high Cl y . Except for AMTRAC3, all the models with 
a fast circulation (shown in green in Fig. 12), also have a problem with excess Cl v in the UT/LS (Ch. 6, 
SCR). The full range of recovery dates in the Antarctic for the 15 CCMs is 2028-2067 (39 years), but the 
range for models that do not have a Cl y conservation problem is 2037-2067 (30 years). Transport 
problems also contribute to this range: slow circulation in the UMUKCA-UCAM model, which results in 
high vortex Cl v , is responsible for this model's 2067 predicted return. The range of predicted return 
dates is 2037-2056 (19 years) for models whose chemistry and transport have been shown to be free of 
major disagreement with observations. 

While much uncertainty in prediction exists because we can only guess what future ODS and GHG 
emissions will be, the current range of predicted return dates is unnecessarily large due to identifiable 
model transport and chemistry deficiencies. This study shows that much of the range of future ozone 
predictions can be directly attributed to identifiable modeling problems in chemistry and transport. It is 
remarkable, and perhaps an encouraging sign of progress in chemistry climate modeling, that a handful 
of models having different dynamical cores, transport schemes, chemical solvers, spatial resolutions, 
etc., all produce very similar lower stratospheric 0 3 and column 0 3 return dates. The explanation for the 
similar behavior in these models is suggested by the transport diagnostic presented in this study: these 
models have credible representation of important physical processes that affect the distribution of 
ozone and other trace constituents involved in 0 3 chemistry. Having models with credible physical 
processes in the lower stratosphere that simulate similar 0 3 behavior increases our confidence in the 
current understanding of the essential chemical and dynamical processes controlling ozone. 
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Table 1. Chemistry Climate Models participating in CCMVal and WMO (2011) 


CCM Name 

Atmospheric GCM 

Reference 

Horiz. Res. of Advection, 
Levels, Top 

AMTRAC3 

AM3 

Austin and Wilson (2008) 

~200 km, L48, 0.017 hPa 

CAM3.5 

CAM 

Lamarque et al. (2008) 

1.9° x 2.5°, L26, 3.5 hPa 

CCSRNIES* 

CCSR/NIES AGCM 5.4g 

Akiyoshi et al. (2009) 

T42L34, 0.012 hPa 

CMAM 

AGCM3 

Scinocca et al. (2008) 

T31L71, 0.00081 hPa 

CNRM-ACM 

ARPEGE-Climate v4.6 

Deque (2007); Teyssedre 
et al. (2007) 

T42L60, 0.07 hPa 

E39CA*t 

ECHAM4 

Stenke et al. (2008, 2009) 

T30L39, 10 hPa 

EMAC vl.6*t 

ECHAM5 

Joeckel et al. (2006) 

T42L90, 0.01 hPa 

GEOSCCM v2 

GEOS5 

Pawson et al. (2008) 

2° x2.5°, L72, 0.015 hPa 

LMDZrepro 

LMDz 

Jourdain et al. (2008) 

2.5° x3.75°, L50, 0.07 hPa 

MRI 

MJ98 

Shibata and Deushi 
(2008a, b) 

T42L68, 0.01 hPa 

Niwa-SOCOL 

MAECHAM4 

Schraner et al. (2008) 

T30L39, 0.01 hPa 

SOCOL 

MAECHAM4 

Schraner et al. (2008) 

T30L39, 0.01 hPa 

ULAQ 

ULAQ-GCM 

Pitari et al. (2002) 

R6L26, 0.01 hPa 

UMETRAC+ 

HadAM3 L64 

Austin and Butchart 
(2003) 

2.5°x3.75°, L64, 0.01 hPa 

UMSLIMCAT 

HadAM3 L64 

Tian and Chipperfield 
(2005) 

2.5°x3.75°, L64, 0.01 hPa 

UMUKCA-METO 

HadGEM-A 

Morgenstern et al. 
(2008,2009) 

2.5°x3.75°, L60, 84 km 

UMUKCA-UCAM 

HadGEM-A 

Morgenstern et al. (2009) 

2.5°x3.75°, L60, 84 km 

WACCM 

CAM 

Garcia et al (2007) 

1.9°x2.5°, L66, 6x10 6 hPa 


*No mean age output submitted. 

+No REF-B2 (future scenario) submitted. 
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Figure 1. Top: Annual zonal mean N 2 0 calculated from 5 years of ACE observations, 2004-2008. Middle: 
Annual zonal mean N 2 0 calculated from 5 years of Aura MLS N 2 0 observations, 2005-2009. Bottom: 
(MLS-ACE)/ACE percentage difference. Lowest MLS N 2 0 level retrieved is 100 hPa. 
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Tropical and NH Midlatitude Mean Age 



Mean Age from Balloon Data (Years) 


Mean Age at 20 km 



Figure 2. C0 2 and SF s -derived mean age observations used in this study. Top: Tropical and midlatitude 
(35-50°N) mean profiles and their la uncertainties (Andrews et al., 2001, and Engel, pers. comm.). The 
Engel et al. (2009) midlatitude mean is an average of 27 profiles. Bottom: The Andrews et al. (2001) 
aircraft C0 2 mean age and lo uncertainty. 
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All Midlatitude and Tropical Observations 
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Figure 3. Scatterplot of all mean age observations and their co-located climatological ACE N 2 0 mixing 
ratios. The points between 0-4.5 yrs mean age and 150-315 ppb N 2 0 have been fit to a line. The yellow 
shading shows the range of uncertainty for the mean age and N 2 0 observations with respect to the 
fitted line. Blue points are the 20 km (50 hPa) measurements, green points are from the midlatitude 
profile below 20 hPa, and red points are from the tropical profile at 10 hPa and below. A breakdown in 
the correlation between mean age and N 2 0 is found at 20 hPa and above in the midlatitudes where 
mean age is >4.5 years (orange points). 
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Mean Age at 20 km 
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Figure 4. Mean age from a 20-yr simulation of the GMI-G4GCM (red) and mean age observations (blue 
points or lines with yellow shading for the lo uncertainty. The close agreement with the tropical- 
midlatitude mean age gradient indicates a realistic tropical ascent rate. The good agreement with the 
tropical mean age profile indicates a good balance between ascent and subtropical mixing in the 
simulation. The simulated mean age agrees within the observed uncertainties everywhere except for the 
SH midlatitude LS and the NH midlatitudes 40-10 hPa. 
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Figure 5. Age spectra from the GMI-G4GCM, from 100-10 hPa and 68°S-68°N. The y-axis is spectral 
density/year.The mean age at each location is shown in the upper right corner of each panel. The yellow 
shaded area under the curve shows the part of the spectrum containing the youngest 1.8 years. The 
area under that curve as percentage of all the age elements is shown in red. 
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Figure 6. Relationship in the GMI-G4GCM between a) mean age and N 2 0, b) mean age and the 
percentage of young age elements, c) the percentage of young age elements and N 2 0 over the entire 
model domain (black). Points in red show the relationship over the domain of the observations used in 
this study (68°S-68°N, 30-150 hPa). In each panel, the relationship in the LS domain is nearly linear and 
mostly monotonic. There are slight hemispheric differences, mostly noticeable in the form of two 
slightly offset red curves in panel c). Panel b) shows a linear relationship between mean age and the 
percentage of the mean coming from young elements in the LS domain. There is a unique relationship 
between N 2 0 and mean age in the lower stratosphere that is determined by transport. 
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Figure 7. N 2 0 and mean age relationship in 15 CCMs (red) in the same domain as the observations 
(black). The yellow shaded area represents the uncertainty of the fitted line (from Fig. 3). Model curves 
that fall off faster (slower) than observed indicate a fast (slow) circulation. Models results are sorted by 
fastest circulations at the top, slowest circulations at the bottom. Results are from the REF-B1 simulation 
of the recent past. 
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Figure 8a. Diagnostic plots evaluating tropical ascent and tropical-midlatitude (horizontal) mixing in 
CCMs. Results are from the REF-B1 simulation of the recent past. The first 3 columns show the 
difference between model and observed profiles of 1) tropical mean age (years), 2) the horizontal mean 
age gradient (years), and 3) tropical N 2 0 (ppb). The yellow shading indicates the ±lo uncertainty in the 
observations. The rightmost column compares the simulated (red) and observed (black) mean age/N 2 0 
relationship in the tropical LS. The middle columns diagnose the reason for the (dis)agreement with the 
tropical mean age profile (far left) by identifying ascent and or mixing problems. Fast ascent decreases 
mean age everywhere while strong mixing across the subtropics increases mean age everywhere. A 
dashed red line indicates there is an issue that complicates the interpretation of that diagnostic - see 
text. The 5 CCMs shown have the fastest circulations. 
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Figure 8b. Same as Figure 8a except for the the 5 CCMs with the most realistic tropical circulation and 
mixing. 
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Figure 8c. Same as Figure 8a except for the 5 CCMs with slow circulations or excessive horizontal mixing. 
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Figure 9. Differences between models and observed 0 3 profiles for the SH and NH midlatitudes and the 
tropics. Model results are from the REF-B1 simulation of the recent past. The models with the most 
realistic tropical transport processes (Fig. 8b) are shown in red. The top panels show 0 3 differences in 
ppb while the bottom panels express the difference as a percentage. The models in dark blue have the 
highest 0 3 (slowest circulation and oldest LS mean age), while the model shown in light blue has the 
lowest 0 3 (fastest circulation and youngest mean age). The models shown in dashed green have 
competing transport errors (fast or variable ascent rates and strong mixing). Yellow shading indicates 
MLS 0 3 2 a systematic uncertainty (Livesey, 2007). Most of the CCMs do not have a tropospheric 
chemistry scheme so tropical 0 3 below 100 hPa should not be compared. 
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Figure 10. Annual mean 60°S-60°N column 0 3 anomalies with respect to 1980 for 15 CCMs. Results are 
from the REF-B2 future simulation. Observations are shown with black asterisks. Column anomalies for 
the 5 CCMs with the best representation of LS transport processes, which also have good agreement 
with LS 0 3 profiles, are shown in red. Models with significant transport issues that do not agree with 
0 3 profiles are shown in green and blue (same colors as in Figure 9). The models shown in black have 
transport issues but have 0 3 profiles in reasonable agreement with observations (Fig. 9) 
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Figure 11. October zonal mean column 0 3 (top) and column anomalies with respect to 1980 (bottom) for 
15 CCMs and observations. Results are from the REF-B2 future simulation. Observations are shown by 
black asterisks. As in the previous figures, CCMs in red have the best LS transport as well as good 
agreement with LS 0 3 profiles. CCMs shown in black have problems with LS transport but good 
agreement with LS 0 3 profiles. The 5 CCMs shown in green have the greatest LS transport problems and 
do not agree well with LS 0 3 profiles. 
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Antarctic Return Date 


Figure 12. The relationship between model return-to-1980 date for the October Antarctic column 0 3 and 
model Cl y at 80°S 50 hPa in 2005. Results are from the REF-B1 simulation of the recent past. Models in 
red have good LS transport. Models in green (blue) have fast (slow) LS circulation. Models in black have 
excessive subtropical mixing and some problems with the tropical ascent rate. Models with an orange 
diamond either have a problem with Cl y conservation (MRI, Niwa-SOCOL, and SOCOL), or excessive Cl y in 
the UT/LS (as diagnosed by Chapter 6 of the SCR). MRI, Niwa-SOCOL, and SOCOL also all have diagnosed 
problems with CIO/Cl y ; they are not included in the fit shown. 
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Figure 13. Top: Annual mean 60°S-60°N column 0 3 in 15 CCMs from 1970 to the present. Results are 
from the REF-B2 future simulation. Observations are shown with black asterisks in each panel. The 
models in green show agreement with the observations to within 5%; models in black do not agree with 
observations to within 5%. Middle: the same CCM output and observations, but plotted as anomalies 
with respect to 1980. The time period is 1960-2080. The dashed vertical lines show the earliest and 
latest predicted return dates for the models with the best agreement with observations, which is the 
same as the range of all 15 CCMS (2004-2058). Bottom: the same CCM output but models with the best 
IS transport are shown in red. The dashed vertical lines show the earliest and latest predicted return 
dates for the red models, 2026-2040. 
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